SEQUENCE LISTING 




™ CENTER m2m 



RECEIVED 



JAN 1 7 2002 



<110> HERMONAT, Paul L. 



<120> ADENO -ASSOCIATED VIRUS AAV REP78 MAJOR REGULATORY PROTEIN, MUTANTS 
THEREOF AND USES THEREOF 

<130> 023533/0130 

<140> US 09/693,908 

<141> 2000-10-23 

<150> US 60/160,608 

<151> 1999-10-21 

<160> 6 

<170> Patentln version 3.1 

<210> 1 

<211> 26 

<212> DNA 

<213> Unknown 

<220> 

<223> Primer 

<400> 1 

ccccggaggc cgaattcttt gtgcaa 2 6 



<210> 2 

<211> 20 

<212> DNA 

< 2 1 3 > Un kn own 

<220> 

<22 3> Primer 



<210> 3 

<211> 20 

<212> DNA 

< 2 1 3 > Un kn own 

<220> 

<223> Primer 

<400> 3 

catatcacca gctcaccgtc 20 



<400> 2 

acaagcagga ttgaaggcca 



20 




<210> 4 
<211> 110 
<212> DNA 

<213> Human papillomavirus 
<400> 4 

actacaataa ttcatgtata aaactaaggg cgtaaccgaa atcggttgaa ccgaaaccgg 6 0 

ttagtataaa agcagacatt ttatgcacca aaagagaact gcaatgtttc 110 



<210> 5 

<211> 4675 

<212> DNA 

<213> adeno-associated virus 2 
<220> 

<221> CDS 

<222> (321) . . (2183) 

<223> 

<400> 5 

ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 12 0 

gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 

ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 

gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 

ggtttgaacg cgcagccgcc atg ccg ggg ttt tac gag att gtg att aag gtc 353 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val 
15 10 

ccc age gac ctt gac ggg cat ctg ccc ggc att tct gac age ttt gtg 401 
Pro Ser Asp Leu Asp Gly His Leu Pro Gly lie Ser Asp Ser Phe Val 
15 20 25 

aac tgg gtg gec gag aag gaa tgg gag ttg ccg cca gat tct gac atg 449 
Asn Trp Val Ala Glu Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met 
30 35 40 

gat ctg aat ctg att gag cag gca ccc ctg acc gtg gec gag aag ctg 497 
Asp Leu Asn Leu lie Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu 
45 50 55 

cag cgc gac ttt ctg acg gaa tgg cgc cgt gtg agt aag gec ccg gag 545 
Gin Arg Asp Phe Leu Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu 
60 65 70 75 

gec ctt ttc ttt gtg caa ttt gag aag gga gag age tac ttc cac atg 593 
Ala Leu Phe Phe Val Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met 
80 85 90 



cac gtg etc gtg gaa accacc ggg gtg aaa tec atg gtt ttg gga cgt 641 

His Val Leu Val Glu Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg 
95 100 105 

ttc ctg agt cag att cgc gaa aaa ctg att cag aga att tac cgc ggg 689 

Phe Leu Ser Gin lie Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly 
110 115 120 

ate gag ccg act ttg cca aac tgg ttc gcg gtc aca aag acc aga aat 737 

lie Glu Pro Thr Leu Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn 
125 130 135 

ggc gee gga ggc ggg aac aag gtg gtg gat gag tgc tac ate ccc aat 7 85 

Gly Ala Gly Gly Gly Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn 
140 145 150 155 

tac ttg etc ccc aaa acc cag cct gag etc cag tgg gcg tgg act aat 833 

Tyr Leu Leu Pro Lys Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn 

160 165 170 

atg gaa cag tat tta age gee tgt ttg aat etc acg gag cgt aaa egg 881 

Met Glu Gin Tyr Leu Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg 
175 180 185 

ttg gtg gcg cag cat ctg acg cac gtg teg cag acg cag gag cag aac 92 9 

Leu Val Ala Gin His Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn 
190 195 200 

aaa gag aat cag aat ccc aat tct gat gcg ccg gtg ate aga tea aaa 977 

Lys Glu Asn Gin Asn Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys 
205 210 215 

act tea gee agg tac atg gag ctg gtc ggg tgg etc gtg gac aag ggg 1025 

Thr Ser Ala Arg Tyr Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly 
220 225 230 235 

att acc teg gag aag cag tgg ate cag gag gac cag gee tea tac ate 1073 

lie Thr Ser Glu Lys Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie 

240 245 250 

tec ttc aat gcg gee tec aac teg egg tec caa ate aag get gee ttg 1121 

Ser Phe Asn Ala Ala Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu 
255 260 265 

gac aat gcg gga aag att atg age ctg act aaa acc gee ccc gac tac 1169 

Asp Asn Ala Gly Lys lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr 
270 275 280 

ctg gtg ggc cag cag ccc gtg gag gac att tec age aat egg att tat 1217 

Leu Val Gly Gin Gin Pro Val Glu Asp lie Ser Ser Asn Arg He Tyr 
285 290 295 

aaa att ttg gaa eta aac ggg -tac gat ccc caa tat gcg get tec gtc 12 65 

Lys He Leu Glu Leu Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val 
300 305 310 315 



7* 



a 



ttt ctg gga tgg gcc acg aaa aag ttc ggc aag agg aac acc ate tgg 1313 
Phe Leu Gly Trp Ala Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp 
320 325 330 

ctg ttt ggg cct gca act acc ggg aag acc aac ate gcg gag gcc at a 1361 
Leu Phe Gly Pro Ala Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He 
335 340 345 

gcc cac act gtg ccc ttc tac ggg tgc gta aac tgg acc aat gag aac 1409 
Ala His Thr Val Pro Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn 
350 355 360 

ttt ccc ttc aac gac tgt gtc gac aag atg gtg ate tgg tgg gag gag 1457 
Phe Pro Phe Asn Asp Cys Val Asp Lys Met Val He Trp Trp Glu Glu 
365 370 375 

ggg aag atg acc gcc aag gtc gtg gag teg gcc aaa gcc att etc gga 1505 
Gly Lys Met Thr Ala Lys Val Val Glu Ser Ala Lys Ala He Leu Gly 
380 385 390 395 

gga age aag gtg cgc gtg gac cag aaa tgc aag tec teg gcc cag ata 1553 
Gly Ser Lys Val Arg Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He 
400 405 410 

gac ccg act ccc gtg ate gtc acc tec aac acc aac atg tgc gcc gtg 1601 
Asp Pro Thr Pro Val He Val Thr Ser Asn Thr Asn Met Cys Ala Val 
415 420 425 

att gac ggg aac tea acg acc ttc gaa cac cag cag ccg ttg caa gac 1649 
He Asp Gly Asn Ser Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp 
430 435 440 

egg atg ttc aaa ttt gaa etc acc cgc cgt ctg gat cat gac ttt ggg 1697 
Arg Met Phe Lys Phe Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly 
445 450 455 

aag gtc acc aag cag gaa gtc aaa gac ttt ttc egg tgg gca aag gat 1745 
Lys Val Thr Lys Gin Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp 
460 465 470 475 

cac gtg gtt gag gtg gag cat gaa ttc tac gtc aaa aag ggt gga gcc 1793 
His Val Val Glu Val Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala 
480 485 490 

aag aaa aga ccc gcc ccc agt gac gca gat ata agt gag ccc aaa egg 1841 
Lys Lys Arg Pro Ala Pro Ser Asp Ala Asp He Ser Glu Pro Lys Arg 
495 500 505 

gtg cgc gag tea gtt gcg cag cca teg acg tea gac gcg gaa get teg 1889 
Val Arg Glu Ser Val Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser 
510 515 520 

ate aac tac gca gac agg tac caa aac aaa tgt tct cgt cac gtg ggc 1937 
He Asn Tyr Ala Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly 
525 530 535 



4 



atg aat ctg atg ctg ttt ccc tgc aga caa tgc gag aga atg aat cag 1985 
Met Asn Leu Met Leu Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin 
540 545 550 555 

aat tea aat ate tgc ttc act cac gga cag aaa gac tgt tta gag tgc 2033 
Asn Ser Asn lie Cys Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys 
560 565 570 

ttt ccc gtg tea gaa tct caa ccc gtt tct gtc gtc aaa aag gcg tat 2081 
Phe Pro Val Ser Glu Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr 
575 580 585 

cag aaa ctg tgc tac att cat cat ate atg gga aag gtg cca gac get 212 9 
Gin Lys Leu Cys Tyr lie His His lie Met Gly Lys Val Pro Asp Ala 
590 595 600 

tgc act gee tgc gat ctg gtc aat gtg gat ttg gat gac tgc ate ttt 2177 
Cys Thr Ala Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe 
605 610 615 

gaa caa taaatgattt aaatcaggta tggctgeega tggttatctt ccagattggc 22 33 

Glu Gin 

620 

tcgaggacac tctctctgaa ggaataagac agtggtggaa gctcaaacct ggcccaccac 22 93 

caccaaagcc egcagagegg cataaggacg acagcagggg tcttgtgctt cctgggtaca 23 53 

agtacctegg acccttcaac ggactcgaca agggagagee ggtcaacgag gcagacgccg 2413 

cggccctcga geaegtacaa agcctacgac cggcagctcg acageggaga caacccgtac 2473 

ctcaagtaca accacgccga cgcggagttt caggagcgcc ttaaagaaga tacgtctttt 2533 

gggggcaacc teggacgage agtcttccag gcgaaaaaga gggttcttga acctctgggc 2593 

ctggttgagg aacctgttaa gacggctccg ggaaaaaaga ggceggtaga gcactctcct 2 653 

gtggagccag actcctcctc gggaaccgga aaggegggee agcagcctgc aagaaaaaga 2713 

ttgaattttg gtcagactgg agaegcagae tcagtacctg acccccagcc teteggacag 2 773 

ccaccagcag ccccctctgg tctgggaact aatacgatgg ctacaggcag tggcgcacca 2 833 

atggcagaca ataacgaggg cgccgacgga gtgggtaatt cctccggaaa ttggcattgc 2 893 

gattccacat ggatgggcga cagagtcatc accaccagca cccgaacctg ggccctgccc 2 953 

acctacaaca accacctcta caaacaaatt tccagccaat caggagcctc gaacgacaat 3 013 

cactactttg gctacagcac cccttggggg tattttgact tcaacagatt ccactgccac 3 073 

ttttcaccac gtgactggca aagactcatc aacaacaact ggggattccg acccaagaga 3133 

ctcaacttca agctctttaa cattcaagtc aaagaggtca cgcagaatga eggtacgacg 3193 



•{<> 

z 6 



acgattgcca ataaecttac £agcacggtt caggtgttta ctgactcgga gtaccagctc 3253 

ccgtacgtcc tcggctcggc gcatcaagga tgcctcccgc cgttcccagc agacgtcttc 3313 

atggtgccac agtatggata cctcaccctg aacaacggga gtcaggcagt aggacgctct 3 373 

tcattttact gcctggagta ctttccttct cagatgctgc gtaccggaaa caactttacc 3433 

ttcagctaca cttttgagga cgttcctttc cacagcagct acgctcacag ccagagtctg 3493 

gaccgtctca tgaatcctct catcgaccag tacctgtatt acttgagcag aacaaacact 3 553 

ccaagtggaa ccaccacgca gtcaaggctt cagttttctc aggccggagc gagtgacatt 3 613 

cgggaccagt ctaggaactg gcttcctgga ccctgttacc gccagcagcg agtatcaaag 3 673 

acatctgcgg ataacaacaa cagtgaatac tcgtggactg gagctaccaa gtaccacctc 3733 

aatggcagag actctctggt gaatccggcc atggcaagcc acaaggacga tgaagaaaag 37 93 

ttttttcctc agagcggggt tctcatcttt gggaagcaag gctcagagaa aacaaatgtg 3 853 

aacattgaaa aggtcatgat tacagacgaa gaggaaatcg gaacaaccaa tcccgtggct 3913 

acggagcagt atggttctgt atctaccaac ctccagagag gcaacagaca agcagctacc 3973 

gcagatgtca acacacaagg cgttcttcca ggcatggtct ggcaggacag agatgtgtac 4033 

cttcaggggc ccatctgggc aaagattcca cacacggacg gacattttca cccctctccc 4093 

ctcatgggtg gattcggact taaacaccct cctccacaga ttctcatcaa gaacaccccg 4153 

gtacctgcga atccttcgac caccttcagt gcggcaaagt ttgcttcctt catcacacag 4213 

tactccacgg gacacggtca gcgtggagat cgagtgggag ctgcagaagg aaaacagcaa 4273 

acgctggaat cccgaaattc agtacacttc caactacaac aagtctgtta atcgtggact 4333 

taccgtggat actaatggcg tgtattcaga gcctcgcccc attggcacca gatacctgac 43 93 

tcgtaatctg taattgcttg ttaatcaata aaccgtttaa ttcgtttcag ttgaactttg 4453 

gtctctgcgt atttctttct tatctagttt ccatggctac gtagataagt agcatggcgg 4513 

gttaatcatt aactacaagg aacccctagt gatggagttg gccactccct ctctgcgcgc 4573 

tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc 4633 

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aa 4675 

<210> 6 
<211> 621 
<212> PRT 

<213> adeno-associated virus 2 

*6 



+ 



<400> 6 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 
15 10 15 



Gly His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 



Thr Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 



Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Met His Val Leu Val Glu 
85 90 95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 
100 105 110 



Arg Glu Lys Leu lie Gin Arg lie Tyr Arg Gly lie Glu Pro Thr Leu 
115 120 125 



Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 



Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Gin Tyr Leu 
165 170 175 



Ser Ala Cys Leu Asn Leu Thr Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 



Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 
195 200 205 



* 



Pro Asn Ser Asp- Ala Pro Val lie Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 



Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 240 



Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 
245 250 255 



Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 



lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Gin 
275 280 285 



Pro Val Glu Asp lie Ser Ser Asn Arg lie Tyr Lys lie Leu Glu Leu 
290 295 300 



Asn Gly Tyr Asp Pro -Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 



Thr Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 
325 330 335 



Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro 
340 345 350 



Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
355 360 365 



Cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 375 380 



Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 



Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro Val 
405 410 415 



lie Val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn Ser 
420 425 430 



Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 



Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 460 



Glu Val Lys Asp Phe Phe Arg Trp Ala Lys Asp His Val Val Glu Val 
465 470 475 480 



Glu His Glu Phe Tyr Val Lys Lys Gly Gly Ala Lys Lys Arg Pro Ala 
485 490 495 



Pro Ser Asp Ala Asp lie Ser Glu Pro Lys Arg Val Arg Glu Ser Val 
500 505 510 



Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Ser lie Asn Tyr Ala Asp 
515 520 525 



Arg Tyr Gin Asn Lys Cys Ser Arg His Val Gly Met Asn Leu Met Leu 
530 535 540 



Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Ser Asn lie Cys 
545 550 555 560 



Phe Thr His Gly Gin Lys Asp Cys Leu Glu Cys Phe Pro Val Ser Glu 
565 570 575 



Ser Gin Pro Val Ser Val Val Lys Lys Ala Tyr Gin Lys Leu Cys Tyr 
580 585 590 



lie His His lie Met Gly Lys Val Pro Asp Ala Cys Thr Ala Cys Asp 
595 600 605 



Leu Val Asn Val Asp Leu Asp Asp Cys lie Phe Glu Gin 
610 615 620 



